We can denote . Since Solve for , we have , which minimizes .
2.2 Pythagorean Identity
The cross term vanishes because of .
Now we go back to
3 Multivariate Normal & t-Distribution
Suppose , , denote is defined as t-distribution.
Proposition
Density of is
Proof
By law of total probability since and ,
Note when is large, is close to .
Fact
If has components , then for each , where is the th component of and is the th entry of .
4 Back to Bayesian Inference
Therefore for second model, , the posterior is t-distribution:
In previous notes we have unbiased estimator (note has changed to here), so
When the degrees of freedom of the t-distribution is large, i.e. when is large, we can approximate the second model with
For general setting (as has components), so . It's also clear that , and therefore Similarly with , denote so By fact,
When is large, (3.1) goes to , (3.2) goes to .